BEST AVAILABLE COPY 




BEST AVAILABLE COPY 



Copyright © 1995 by The RockefeUer Univenity Press 
All rights reserved 

Libraiy of Congress Catalos Card Number 95-067143 

ISBN0-8747(MJ57^ 

Printed in the United States of America 



Chapter 1 



Models of Protein Structure 



Structural Models of Na"*-, Ca^"**, and K"*" Channels 



H. Robert Guf and Stewart R. Durell 

Laboratory ofMaAematiad Biology, DCBDQ National Cancer 
Institute, National Institutes ofHeabh, BeAesda, Maryland 20892 

The most serious impediment in the field of ion channel research is the lack of 
detailed^ experimentally determined structural models. These structures are vital for 
understanding haw channels function, how drugs and toxins modulate these func- 
tions, and how naturally occurring genetic mutations lead to disease. Unfortunately, 
the very nature of membrane*bound ion channel proteins makes them extremely 
difficult to study by crystallographic and NMR methods. As a confounding factor, 
funding agencies have been reluctant to sponsor these long*term structure detenhi- 
nation projects, which admittedly have uncertain futures. 

In light of these problems, a large multidisciplinaxy effort has arisen to deter- 
mine structural information about ion channels using more accessible, albeit less 
certain methods. Our contribution to this process has been to combine the available 
experimental data with structure prediction principles to develop molecular models 
of the ion channel proteins. This is an iterative procedure, where the models are used 
to suggest further experiments, and the resulting data are used to refine the models. 
For example, mutagenesis experiments have been veiy effective in verifying many 
features of the early voltage-gated channel models, such as which sequence segments 
span the membrane and form the voltage sensor, inactivation gate, selectivity filter, 
and ligand binding sites (Ouy.aiid Ehirell, 1994)* Likewise, results of mutagenesis 
experiments have been instrumental in developing our latest refinements of the 
outer vestibules and ion-selective portions of K+, Na"*', and Ca^* channels. Here we 
present these latest models and new models of the entire transmembrane and outer 
portions of the inward-rectifying, ROMKl channels. 

Na^ and Ca^-*- Channel Outer Vestibules and Ion-selective Regions 
The mutagenesis esxperiments described below have confirmed our prediction that 
the P segments (also called SSI and SS2 in Na^ channels and HS m K-*- diannels) 
form the inner pore and ion-selectivc filter of K*, Na+, and Ca^^" channels. In our 
first models of Na*** channels (Guy and Seetharamulu, 1986), we suggested that the P 
segments of repeats I to III each form a short ot-helical hairpin, and that the 
negatively charged faces of the second helix form the ion-selective lining of the pore. 
Since then we have considered many other possibilities: e.g., a helix followed by a p 
strand (Guy, 1990; Guy and Conti, 1990), a p hairpin that assembles into a P barrel 
(Durell and Gvy, 1992)« and a helix followed by a random coiled structure in 
channels (Guy and Durell, 1993). Recent analysis of the e^»anded set of P segment 
sequences and new experimental data from Na+ and Ca^* channels have lead us to 
return to the ori ginal helical hairpin motif. However, now the ion-selectzve region 
and tetrodotoxin (TTX) and saxitoxiu (STX) bmding site is postulated to be formed 
by the central residues that link the two helices and/or by the residues that initiate 
the second helix. In addition, the helices are no longer postulated to be oriented 
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paraUel to the pore's axis, but rather are tUted so that they form a cx)ne-shaped outer 

vestibule. , . . - • r«.. ♦u^ 

A representation of P segment sequences is shown m Fig. 1. To simplify the 
comparison of homologous sequences, the numbers have been adjusted to match the 
aligned prolines of the channels and of repeats II and IV of the Ca^^ channels. In 
the Ca2+ channel sequences, the best conserved residues among the four repeats are 
the threonine at position 12p {T12p). the glutamic acid at position 14p (E14p), and 
the tryptophan at position 16p (W16p). It is therefore not surprising that mutagen- 
esis experiments have identified the residues at position Up to be a primary 
determinant of the ion selectivity. In particular, mutation of the positively charged 
K14p of repeat III and the uncharged A14p of repeat IV in Na+ channels to 
negatively chaxged glutamic acid makes the pore properties resemble those of the 
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Figure 1. Sequences of P segments 
from a Na* rat brain channel (Noda 
et al., 1986). a skeletal muscle Ca^* 
channel (Tanabe, Takeshima, Mi- 
kami» Flockerzi, Takahashi, Kan- 
gawa, Kojoma, Matsuo, Hirose, and 
Numa, 1987), a cyclic nUdeotide- 
gated channel (cGMP) (Kaupp, Ni- 
idome, Tanabe, Terada. Bpnifijt, Stflh- 
roer, Cjook, Kangawa, Matsuo, Hirose, 
Miyata, and Numa, 1989), KATl Am- 
bidose channel (Anderson, Hupri- 
and Gaber, 



kar, Kocbian, Lucas, 
1992). EAG K* channel (Warmke, Diysdale, and Ganetzl^, 1991), Shaker voltage-gated K^^ 
channel (Tempcl Papazian, Schwatz, Jan, and Jan, 1987), raSlo Ca^-^-activated channel 
(Butler et al., 1993), a putative K"*" channel from E. coU (Milkman and McKanc-Bridges, 1993), 
and an inward-rectifying K+ channel, ROMKl (Ho, Nichols, Lederer, Lytton, Vasdlev, 
Kanazii^ and Herbert, 1993). Na+ and Ca^* channel segments are numbered beginning 
with the first proline in the repeats H and IV of (V* channels (top) and K*^ channel segments 
beginning whh the first proluie in Shaker P segment (bottom). Residues that occur in three or 
more sequences in this alignment are bold, those that occur in two sequences are in normal 
upper case, and those that occur in only one sequence are in lower case. Alignment of Na+ and 
Ca^+ with JL* channels is difficult and ambiguous. 



native Ca^* channels (Heinemann, Terlau, and Imoto, 1992fl). That is, the permeabil- 
ity of the mutant channels to Na* is blocked by low concentrations of Ca^*, and at 
higher concentrations the channels become permcant to Ca^"^. Likewise, mutating 
any of the E14p's of Ca-+ channels to lysine makes the pore properties similar to 
those of Na+ channels (Kim, Morii, Sun, Imoto, and Mori, 1993; Mikala, Bahinski, 
Yatani, Tang, and Schwartz, 1993; Yang, Ellinor, Sather, Zhang, and Tsien, 1993). 
However, the exact effect of mutating the E14p*s of Ca^+ channels to lysine or 
glutamine depends upon which repeat is altered. Thus, in spite of the sequence 
similarity of the four repeats, the pore is functionally aqmmetric. In addition to this, 
other ocperiments have shown that calcium channels appear to have two Ca^* 
binding sites near the extracellular surface (Ahners and McQeskey, 1984; Hess and 
Tsien, 1984). To explain these results, Yang et aL (1993) suggested that some of the 
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14p glutamic adds fonn one Ca^^ binding site near the extraceUular entrance of the 
pore and others form an additional site farther down inside the pore. In our calduip 
channel models, the asymmetiy and multiple binding sites were approached by 
considering the differences m the central residues which link the helices of the P 
segment hairpins (see Fig. 1). For eioimple, g^ydnes occur at position 13p in repeats 
II and IV and at position ISp m repeats I and IIL Due to their enhanced flesdbflity, 
glycine residues occur most commonly in the coil and turn segments of proteins 
(Chou and Fasman, 1978). In addition, highly conserved gylcines tend to be involved 
in backbone folds that can not be accommodated by the other, more conformation- 
ally restricted residue types (Overington, Donnelly, Johnson, Sali, and Blundell, 
1992). In the P segment model, repeats n and IV have only two residues, T12p and 
G13p, linking the helices; whereas, repeats I and III have four nonhelical residues, 
12p through 15p, linking the helices (see Fig. 2 A). In repeats I and III, residues 
13p-16p are postulated to fonn a type II p tum. This corresponds with the position of 
the glycmes in these repeats, which usually are at the third position of a type n 
tum (Rose, Gierasch, and Smith, 1985). A longer linking segment is also suggested in 
repeat lU by the presence of the proline at position 17p, which is tiie second residue 
of the second helix in the model. Due to their unusual structure, proline residues 
inhibit heUcal formation in the residues preceding them, but often initiate helices in 
the residues that follow them (Chou and Fasman, 1978). 

As shown in Fig. 2 A, the four P segment hairpin structures are assembled, with 
an approximate fourfold symmetiy, about the axis of pore to form a cone-shaped 
outer vestibule. This model was also designed so that the residues which would be 
esqwsed to water in the vestibule are primarily bydrophilic, and the residues whidi 
axe buried between the helices and the other transmend^rane segments are hydropho- 
bia For the reasons described above, the negatively charged E14p carbo)^late 
groups are made to form the selectivity filter, by bemg placed at the narrowest 
portions of the pore. E14p of repeats II and IV form a Ca^^ binding site near the 
extracellular entrance of the pore, and the BI4p of repeats I and m fonn a second 
site farther down in the pore. In addition to differences in the backbone structure, 
side chain differences introduce additional asymmetry; e.g^ in this model, Ca^'*' may 
bind off the pore's axis to D17p of repeat I and E17p of repeat TV in the outer 
entrance of the selecthdiy ater and to D15p of repeat n in the inner entrance. 

Unfortunately, the P segments of the Na^ channel are more diflScult to model 
because their sequences difiter more among the four repeats than they do for the 
Ca2+ channels. Thus, backbone conformations of the P segment from different 
repeats are likely to differ even more than in Ca^^ channels. Fortunately, however, 
more esqperimental data are available fbr Na*^ channels than for Ca^-*- channels. For 
example, not only have mutagenesis experiments indicated that the 14p residues of 
Na*" channels are crucial for ion selectivity (described above), but that the binding of 
TTX and STX depends strongly upon the identity of the 14p and 17p residues in all 
four repeats (Terlau» Heineraann, Stfihmer, Pusch, Conti, Imoto, and Numa, 1991). 
Conversely, mutation of the residues at positions 9p, 13p, 16p, and 18p appear to 
have litde effect on the binding of these toxins. Also, mutation of the F15p residue of 
repeat I to cysteine causes a decreased sensitivity to TTX and STX and an increased 
sensitivity to blockade by zmc (Backx, Yue, Lawrence, Marban, and Tomaselli, 1992; 
Hememan, Terlau, StOhmer, Imoto. and Numa, 1992i; Satin, Kyle, Chen, Bell, 
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Cribbs, Fozzard, and Rogart, 1992). In contrast, mutation of this residue to tyrosine 
has little effect. 

Fig. 2 D displays one of several helical hairpin models we developed for the 
binding of TTX in Na* channels. In accordance with the data described above, each 
model satisfies the criterion of having almost all of the polar atoms of the toKin 




Figure 2« Models of Ca**^ and Na^ channel P segments. (/I) Models of the four putative P 
segment helical hairpins of a Ca-+ channel viewed as from inside the channel. Top is 
cxtraccUular. Lightly shaded portions arc the putative nonhcllcal linking regions, (B) Model of 
the four Ca*'*^ channel P segments as assembled in the pore viewed from outside the ceil. 
(O Model of the four P segments of Na'^ channel P segments viewed from outside the cell. 
(D). Side view of Na* channel P segments with TTX {outlined molecule) bound in the outer 
entrance of the pore. 

forming salt bridges or hydrogen bonds with the I4p and I7p residues of the P 
segments. Unfortunately, the positions of the helices, conformations of the connect- 
ing loops and side chains, and even the order of the repeats remains ambiguous in 
the Na^ channel model. It should be noted that Lipkind and Fozzard (1994) 
proposed an alternate model for the P segments where ^ hairpins are used instead of 
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helical haiipins. However, the models agree in the sequence location of the turn 
segments, which are likely the ion-selective region. 

K'*' Channel Outer Vestilrales and lon-selecttve Regions 

The channels are easier to model than the Na*^ and Ca^"** channels because they 
are comprised of four identical subunits rather than four homologous repeats. Thus, 
it is more reasonable to assume that the channel has fourfold symmetry with respect 
to the pore's axis, which in turn reduces the number of possible conformations. In 
addition, there is considerably more mutagenesis data for the channels than for 
the other types of cbaimels. However, modeling of the channel P segments is 
complicated by the recent findings of Heginbotham, Lu, Abramson, and MacKinnon 
(1994} which suggest that the ion selective region is not formed by any particular 
residue side cham, but rather by the polar atoms of the backbone. This also 
complicates using substitution-type mutagenesis experiments, which just switch side 
chain identities, for determining which particular residues are. involved in ion 
selectivity. Assuming a hairpin motif, another complicating factor is that the segment 
of linker residues is substantially longer for the K"** channels than for the Na'*' or Ca^"** 
channels (see Fig. 1). Finally, the highly conserved glycines at position 15p and 17p in 
the putative ion selectbe linker region are more difficult to model because they have 
considerably more conformational freedom than the other types of residues* How- 
ever, it will be shown in the following sections how tiiese seeming difficulties may 
actually provide dues to the structure of the chaimel. 

Due to the ambiguities, we have developed a number of different models for the 
P segment structure of K*** channels. Despite the differences, they all share the 
following attributes, most of which have been predicted by mutagenesis and/or drug- 
binding experiments. These are (a) the Ip to 9p segment forms an a helbc that is 
oriented such that residues 2p and 9p, which have been identified with tetraethylam- 
monium (TEA) binding from the outside (see below), are accessible from the 
extracellular surface; (b) residue 20p, which is on the other end of the P segment, 
also forms part of the extracellular TEA bmding site; (c) the backbone carbonyls of 
the highly conserved G15p-Y16p*G17p segment forms at least part of the ion-selectrve 
portion of the pore; (d) residues lip, 12p and 14p, m the middle of the P segpient, 
form at least part of the intracellular TEA binding site; and (e) the D18p residue of 
voltage- and calcium-gated channels interacts with positively charged residues of 
charybdotogdn (CTX) when it is bound in the pore. The major differences among the 
models are in the conformation of the lOp to 20p region in the middle of the P 
segment sequence. In particular, the models of the pore structure have included 
four-stranded p barrels (spiraiing right, left and strai^t up and down), short 
eight-stranded barrels (formed from the linkers in ^-hairpin conformations), a 
series of vertical p turns, and an extended structure in which the backbone has either 
a P-strand conformation or a conformation in which the backbone angles of the 
residues alternate between the values for right- and left-handed a helices (with the 
conserved ISp and 17p glycine residues assuming the left-handed conformation). We 
tcntathrely favor these extended models because they satisfy tibie experimentally 
determined criteria described below of having niunerous K"^ binding sites and a 
substantial distance between the intracellular and extracellular TEA binding sites 
and because they provide a role for the highly conserved glycine residues. The 
secondary structure predictions of the first a helix and the random coil for the central 
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region are supported by circular dichroism analysis of the isolated Shaker P segment 
in apolar solvent (Peled and Shai, 1993). 

Sequence homology and mntagenesis data. Identification of which residues and 
regions of the sequence are conserved among different protein species is often useful 
in prediction of protein structure (see Guy and Durell, 1994). Conserved residues 
generally pl^ important structural and/or functional roles in the protein. Elucida* 
tion of these roles is facilitated by correlating the conservation of known functional 
properties of different proteins with the conservation of the sequences. Fortunately 
for modeling efforts, there has been a rapid growth in the acquisition of K"^ channel 
sequences from a variety of families. These families include, voltage-gated, Ca^*- 
activated, Eag, plant, bacterial, and Mg^^-blocked inward-rectifying channels. Be- 
cause all of the channels are by definition selective for K+, comparison of the 
most distantly related sequences is useful for identifying which residues are essential 
for K+ selectivity. For example, 9p-18p is by far the best conserved segment among 
the distandy related K**- channel sequences. Withm this segment, the Mg^'^-blocked 
inward-rectifying channels (ROMKl, IRKI, GIRKl, KATF) have only five residues 
(TlOp, T13p, G15p. Y16p, and G17p) that are identical to those of most other 
channels (see Fig. 1). In addition, many of the residue substitutions arc nonconserva- 
tive; For example, the hydrophobic V9p, Mllp, and M19p or V19p residues in the 
Shaker and Slow-poke sequences are replaced by the hydrophilic E9p, Qllp, and 
R19p residues in the ROMKl sequence, and the negatively charged D18p residue in 
the Shaker and Shw-poke sequences is replaced by the hydrophobic F18p residue in 
the ROMKl sequence. Of the five conserved residues Just described, the two 
threonine residues are also found conserved in the sequences of the cyclic nucleotide* 
gated channels (see Fig. 1). Because the cyclic nucleotide-gated channels are 
nonselective among cations, these threonines are probably conserved for reasons 
unrelated to K+ selectivity. Also, TlOp, T13p, and Y16p are replaced by SlOp. C13p, 
and F16p in the Eag K+ channel sequence, which leaves the two glycines as the only 
residues conserved among all of the K* channel sequences. This suggests that these 
glycine residues are crucial for the K**" selectivity of the channels. 

The importance of the two conserved glycines for K+ selectivity is also indicated 
by .the mutagenesis experiments df Heginbotham et al. (1994), in which every 
position of the central region of the Shaker P segment was mutated one residue at a 
time. It was found that every residue other than G15p and G17p can be substituted 
without substantially altering the channels selectivity for K+ over Na+. Substitution 
of either glycine, however, greatly reduces the channel's selectivity among cations. 
SimUarfy, Heginbotham, Abramson, and MacKinnon (1992) were able to reproduce 
the nonselective properties of the <yclic nucleotide-gated channels by deleting the 
G15p and Y16p residues in the Shaker P segment, which is the same as the natural 
deletion between the two channel sequences (see Fig. 1). In particular, the mutated 
channel was pcrmeant to both Na+ and and was blocked by relatively low 
concentrations of Ca^^. Likewise, the additional mutation of the DlSp residue to 
ElSp resulted in conduction properties similar to those of Ca^* channels, which 
naturally have the a glutamic acid residue at this location and also lack the G15p and 
Y16p residues of channels (see Fig. 1). 

These findmgs have important implications for the structure of the ion-selective 
portion of the pore. As described above, the mutagenesis results indicate that the 
highly conserved 15p and 17p glycine residues are tiie most influential in determining 
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the selectivity. (It should also be noted that the hydrwiyl group of TlOp may also 
play a role in selectivity [Heginbothain et al., 1994]). Because glycine residues lack 
side chains, we modeled the selectivity filter, whidi is taken to be the narrowest 
portion of the pore and in direct contact with the cations, to be formed by the partial 
negatively charged carbonyl o^iygens of the conserved G15p-Y16p<}17p residue 
backbones (Heginbotham et al., 1994). In addition, we make the assumption that the 
glycine residues were probably conserved by evolution because they are able to 
assume a conformation which is energetically imfavorable for other residue types, 
which is generally what is observed for the highfy conserved gyldne residues in 
soluble proteins (Overington et al., 1992). For this reason, the ion-selective, central 
region of the P segment would likely not have a regular a helical or p strand 
conformation. Instead, our models of the K"^ channels have tins region of the P 
segment forming either a random or extended conformation (see Fig. 3). 

Multiple K*** binding sites. Experimental studies have indicated that Ca^^- 
activated channels may possess up to four K+ binding sites in their pore (Neyton 
and Miller, 1988). Assuming that the sites are arranged smgle file, this helps 
constrain the modelmg by requuing a long, narrow pore structure. As shown in Fig. 
3, this is accomplished by having the highly conserved, ion-selective region of the F 
segment in an extended conformation, oriented parallel to the axis of the pore. Two 
conformational extremes are illustrated. Fig. 3 E shows the pore when no ions are 
present, and thus, the amide groups are free to form interstrand hydrogen bonds. 
This four-stranded structure is very atypical, in that most p barrels have at least six 
strands which spiral around the axis in a right-handed fashion. However, the lack of 
interior side chains due to the one-residueTseparated portions of the conserved 
GlSp and G17p residues makes this narrow structure feasible. Fig. 3 F shows the 
other extreme, where the amide groups have rotated by 90'' so that the carbonyl 
o^gens can interact with the ions in the pore. To accomplish this, the torsion angles 
of the residue backbones alternate along the strand between the values for left- and 
right-handed a helices. Once again, this is feasible due to the fact that the conserved 
glycme residues lack side chains and are thus able to assume left-handed helical 
conformations. The V14p and Y16p residues assume the normal right-handed 
helical torsion values. This peculiar conformation for the strands allows for all the 
carbonyl osgrgens to pomt into the pore and form binding sites. Two basic types of 
binding sites are imagined: one with the ion in the plane of four carbonyl oxygens, 
and one with the ion in between two planes of four oxygens each (thus, binding to 
eight oxygens all together). In these models, the pore's lining is envisioned to be a 
highly cfynamic, polarizable structure whose precise conformation is influenced by 
the location and number of ions in the pore. The examples illustrated are the 
narrowest pores that we have modeled. The pore may be made larger by adding 
additional water modelcules. 

TEA binding sites* The prediction of a long, extended pore structure is also 
supported by the data for TEA binding. Many of the different K+ channel pores are 
found to be blocked by TEA at two distinct sites; one accessible from outside the cell 
and the other from inside the cell. From the voltage dependency of binding, it is 
found that the TEA molecule need not traverse much, if any, of the transmembrane 
electric field to reach the extracellular site, and only transverses 20% of the field to 
readi the mtracellular site (Miller, 1991). Although electrostatic repulsion between 
TEA ions at the two sites has been reported (Newland, Adeiman, Tempel, and 



8 



Ion Channels and Genetic Diseases 




Figure 3. Models of Shaker K+ channel P segments. {A) Assembly of four P segments to form 
outer vestibule and ion-seleciive region of the channel; view foim outside the cell Dark gray 
tubes are backbones of a helices; white tubes are random coil backbone. Side chain color code: 
gray=alkyl carbons, pink=unchargcd oxygens, red^^negatively charged oxygens, h'ght 
blue=uncharged nitrogens, yellow =sulfurs. (B) Cross-«ction of outer vestibule and ion- 
selectWe region showing two P segments. View is the same as in A except monomers are 
rotated by 90^ Residues are colored coded accordmg to how well they are conserved among 
distantly related K* channels: the number of residue ^es in all sequences arc: yel!ow=l, 
orange«2, rcd=3, purple=4, blue=5. (C) Side view of P segments illustrating residues where 
mutations alter TEA binding from outside (ffeen) and inside {purple) the cell, TEA molecules 
{dark bhie) are ^15 A apart. {D) Illustration of chaiybdotoxin (CTX) bound in the outer 
vestibule of a Ca-^-activated mslo channel. CTX residues are colored according to how much 
mutations affect the off rate of binding: red=large effect, yel low s= moderate effect, green==little 
or no effect. Crucial CTX residues are labeled black and negatively charged channel residues 
are labeled white. (E an</F) Side view of putatWe ion-selective portion of ROMKl channel in 
two extreme conformations. In £, residues 14p^l6p have a p conformation and the pore is 
filled with water inolecules {pink and white). In F, residues I14p and Y16p have a right handed 
a type conformation, 015p and 017p have a left handed a type conformation and the pore is 
filled with K* ions (cya/i). The actual pore is envisioned to be a highly dynamic structure that 
exists between these extremes most of the time. 0163 and N171 arc residues on M2. All 
backbone atoms arc shown. Color code: black=carbons of lOp-ITp, white ==carbons of other 
segments, pink-uncharged oxygens of backbone or side chain, red = negatively charged side 
chain oxygens, light blue -uncharged nitrogens, cyan = potassium ions, red dashed lines-salt 
bridges, black dashed lines«hydrogcn bonds. 

Aimers, 1992), the fact that they are separated by 80% of the transmembrane electric 
field indicates a considerable separation distance. 

Fig. 3 C shows the positions of residues in our model where mutations alter TEA 
binding at the extracellular (MacKinnon and Yellen, 1990; De Biasi, IQrsch, Drewe, 
Hartmann, and Brown, 1993) and intracellular (Yellen, Jurman, Abramson, and 
MacKinnon, 1991; Hartmann, Kirsch, Drewe, Taglialatela, Joho, and Brown, 1991; 
Choi, Mossman, Aubie, and Yellen. 1993) sites. The 20p residues from the four 
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subunits were positioned near the otracellular entrance to the pore so that when 
th^ are tyrosines or phenylalanines their aromatic groups form much of the TEA 
binding site, as suggested by Heginbotham and MacKinnon (1992). As described 
above, the extended structure of the pore maxtmizes the distance between the 14p 
and 20p residues, which are known to affect intracellular and extracellular TEA 
binding, respectively. The experimental data for the mutation of the 9p residue to 
histidme (De Biasi et aL» 1993) is especially interesting because it alters the effects of 
extracellidar TEA, zinc» histidine reagents, and changes in pH even though it is dose 
in sequence to the Mllp and T12p residues, where mutations affect the binding of 
intracellular TEA In the models, the 9p residue, which is a valine in the Shaker and 
Slow-poke sequences, is buried in the nonpolar interior of the protein near the 
extracellular entrance to the pore. In the inwanl rectifier (ROMKl) the 9p residue is 
a buried glutamic add; however, it is able to form neutralizing hyckogen bonds with 
the amide groups of four proximal residues in the model (see 3 f). Although the 
9p residue is not directly in the pore luimg or TEA binding site, as suggested by 
DeBiasi et al. (1993), it is in a crucial position such that the nonconservative 
mutation to histidine could easily affect the pore's structure and properties, as 
described above. The possibility that this mutation alteis the backbone conformation 
of the channel is suggested by the finding that the mutation of the 9p residue to 
histidine also effects the channel's activation gating kinetics (De Biasi et al., 1993). 
Mutagenesis experiments also indicate that 2p is sufficiently near the extracellular 
TEA binding site to affect TEA binding by an mdircct electrostatic mechanism 
(MacKinnon and Yellen, 1990). In analogy to the Na*^ and Ca^-^ channel models, we 
have modeled the K-f channel Ip-lOp segment as an a helix. This model has the 
advantage of pladng 2p and 9p relathrely near each other on the face of the helbc that 
is postulated to form part of the lining of the outer vestibule. 

CTX bbiding. Docking of a chatybdotoxin (CTX) molecule into the outer 
vestibule model was used to constrain the orientations of the helical portions of the P 
segments that are postulated to foim the outer vestibule of the channel models. CTX 
is a scorpion toxin peptide.that is known to block the extracellular entrance of some 
voltage-gated and Ca^^-acthrated K+ channels (Anderson, MacKmnon, Smith, and 
Miller, 1988; MacKinnon and Miller, 1988). Fortunatety, the three-dimensional 
structure of the peptide in solution has been determined by NMR (Bontems, 
Gfiquin, Roumestand, Menez, and Toma, 1992). Mutation of the positively charged 
K27 of CTX to a neutral residue has been shown to reduce the overall binding aflSnity 
and eliminate the ability of extracellular CTX to compete with intracellular (Paric 
and Miller, 1992). This strongly suggests that the positively charged ammonium 
group of K27 normally reaches into one of the K+ binding sites in the ion-selective 
re^on of the pore. As depicted in Fig. 3 A the K27 CTX side chain is positioned 
inside the ring of the four negatively chaiged D18p residues of the P segments. TTiis 
orientation for the CTX molecule is also supported by the finding that mutation of 
the SIO, W14, R25, M29. N30, R34, and Y36 residues, which are on the same f a<» of 
the CTX molecule as K27, signfficantly affect the dissociation rate from Ca2+- 
activated channels- Likewise, nonconservative mutations of residues on the 
opposite side of the CTX molecule have little effect on the dissociation rate (Stampe, 
Kohnakova-Partenslgr, and Miller, 1994). In the model shown m Fig. 3 D, the 
positively charged R25 and R34 residues of CTX join with the K27 residue to form 
salt bridges with the four DlBp residues. In addition, the Kll and K31 residues of 
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CTX form salt bridges with the E2p residues of adjacent subunits on the periphery of 
the outer vestibule. This interaction is supported by the finding that in the Shako' 
channel the mutation of D2p to asparagine eliminates the binding of CTX (MacKm- 
non and Yellen, 1990). Also in the model, the aromatic F2 and W14 residues of CTX 
bind at the beginning of the second helix of opposite subunits and interact with the 
Y20p and K22p residues of the P segments. Aromatic residues are known to have 
fiavorable interactions with other aromatics (Burley and Petsko, 1985) and with 
positively charged residues in known protein structures. Thus, CTX can dock into the 
model (tf the outer vestibule in a manner in which the interactions between the toxin 
and pore are highly complementary and consistent with the experimental findings. 

Inward Rectifying K*** Channels 

A major shortcoming of the models described so far is that they lack the surrounding 
transmembrane segments (Sl-^6), which would likely influence the predicted confor- 
mations of the P segments. Although we have attempted to make atomic-scale 
models for the surrounding segments of voltage-gated (DureD and Guy, 1992) and 
Ca^+.activated K+ channels, the structures are too large to predict with much 
certainty. However, the situation is simplified for the inward-rectifying and homolo- 
goiisi K* channel proteins, which probably have only two transmembrane helices (Ml 
and M2) per subunit in addition to the P segments. 

Fig. 4 displays the model of the Ml, M2. extraceUular linkers and P segments of 
the ROMKl channel. The procedures used to build the models has been described 
previously (DureU and Guy, 1992). In developing these models, we attempt to 
optimize the following interactions: 

(a) Side chain-water interactions. Polar residues are made to contact other 
polar residues or the solvent, 

(6) Side chain-Iipid interactions. Nonpolar residues are either buried in the 
protein or made to be in contact with the alkyl chains of the membrane. 

(c) Side cham-side chain interactions. A disulfide bride is formed between C121 
and C153. Salt bridges are formed from D108 and EUl to R147, from D116 and 
E123 to R118, and from E152 to H106. Hydrogen bonds are formed between S130 
and E153, YlOO and S135, S135 and S164, and W92 and 0139". A cluster-of aromatic 
side chains is formed by W92, F95, W99, YlOO. F134. Y144, F146, and F148 (see Fia. 
4,Cand£). ^ ^ 

(d) Backbone hydrogen bonds. Virtually all backbone polar atoms are required 
to form hydrogen bonds. Most backbone polar atoms tiiat are witiiin regular 
secondary structures, i.e., a or 3,o heUces or ^ sheets form hydrogen bonds to other 
backbone atoms. At the ends of helices or in "random coil" segments some of these 
atoms may hydrogen bond to side chain atoms, e.g., K107 interacts with the COOH 
terminal of Ml, D108 and T139 with the NHj terramal of tiie first P-segment helix, 
T133 Q139 and Q164 with the COOH terminal of tiiis helix, Q164 to the loop region 
that follows it, E137 with the pore-lining segment and the NH^ terminal of ttie 3,0 
hehx postulated to link the lining to M2, and T119 witii the NH^ terminal of a surfoce 
helix m Uie Ml-P linking segment (see Fig. 4 C). The Ml-P linking segment is 
postulated to have a random coil segment located on the extracellular surface of tiie 
protem where some unpaired backbone polar atoms hydrogen bond to water. Within 
the pore, some backbone ojqrgens may interact witfi ions (see Fig. 3 F). The putative 
random coU segments contain proline and glycine residues that tend to disrupt 
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helices and ^ sheets, whereas the putative helices do not contain these residues 
accept for prolines at some of the NH2 tenimals where they are energetically 
favorable. 

(e) Residue packing. The structure is modeled so that the atoms within the 
protein pack tightly together with no large cavities within the protein (see Fig. 4;d 
andE). 

(f) Torsion angles* Residues are modeled with backbone and side chain torsion 
angles that occur common^ in known protein structures. 





Hgnre 4. Model of tzansmembrane and extraceUular portions of ROMKl channel. (4) View 
from outside and (B) side view of baddione structure. Segments are color coded as: green 
=^1 yellow-Ml-Pe«raceUular linker, orangc=»Pl, red-P2,purplc«P-M2 linker, blue=M2. 
(Q Side view iUustrating encigeticaUy favorable bteractions. Shaded axea color code- 
yeltow=disulfide bridge, red=salt bridges, blue=side chain-side chain fcydrogen bonds, 
qran«side chain^ackbone hydrogen bonds, purple«animatic duster. Gr^ backfoone«P 
segraem, white badcbone=other segments. Polar side chahi atoms colored as in Fig, 3 A. 
(PandE) Space-filled model of channels cross-section through die P segments illustrating 
tj^t pa«*ing. (Z)) Cotered by segments asinA imdB.(E) Colored as in C but with aromatic 
side diain carbon atoms colored purple in P and lavender m other segments. (F) Side view 
fllustratmg extent of sequence conservation. Yellow residues are identical in aH K+ channel 
fanuhes except Eag. Color code for number of residue types in mulUsequence alifinment of 
mward recnfying K+ channels: orange=l. red«2, puiplc«3, bluc=4, gray at5. 

(g) Sequence conservation. Special attention is placed on whether a residue is 
conserved m the family of sequences (Ouy and Durell, 1994). As is generaHy 
observed for known soluble and transmembrane protein structures, the poorly 
wnseiyed polar and nonpolar residues are put m contact with cither the solvent or 
the lipid, respecth^ely (see Fig. 4 f). In contrast, the conserved residties are used for 
unportant structural and/or functional purposes. For example, the putative selectiv- 
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ity filter region is the most highly conserved part of the protein among all K+ 
channels. In addition, many of the energetically favorable side chain-side chain and 
side chain-backbone interactions described above involve residues that are highly 
conserved within the family of channels that indudcs ROMKl. 

The most noteworthy aspect of the ROMKl model is that, beginning with the 
backbone stnicture of P segments developed from veiy distantly related K+ channels, 
a model that satisfies all our criteria described above could be made of all the protein 
segments surrounding the P segment Although the model of the pore formed by Ml 
and M2 in the intracellxilar half of the transmembrane region is more ambiguous, one 
aspect is consistent with recent experimental findings. N175 has been shown to be 
invohrcd in the voltage-dependent blockade of ROMKl by magnesium (Lu and 
MacKinnon, 1994). This effect has been postulated to be caused by an electrostatic 
effect. In our models, this residue is on the lining of the pore just beyond the end of 
the P segment where one might e^qpect magnesium to bind (see Fig. 3 £). 



Summaiy 

The structure determination of voltage-gated channels by the combination of 
molecular modeling and mutagenesis experiments is a long term, iterative process. 
As such, the models should be considered as a work in progress, with changes 
expected as more data becomes available. The priroaxy role of the models is that they 
assimilate the known data and provide ideas for further experiments to elucidate the 
real structures. Although the models presented here have already gone through two 
or three cycles of development and testing, many aspects remain tentative. Perhaps . 
the most significant result so far is that the P segment was experimentally confiiroed 
to form the ion-selective part of the channel. In a subsequent cycle of testing and 
modcfing, the specific residues responsibilhy for Na+ and Ca^"*" selectivity have been 
identified and the selectivity filter of K+ channels is now predicted to be formed not 
by the side chains, but rather by the carbonyl oxygens of the conserved Gly-iyr-Gly 
sequence backbone. As another example, the 9p residue of the P segment of K* 
channels was originally modeled as either being buried in the protein or accessible 
from inside the cell only. However, once mutation of this residue to bistidine was 
found to affect blockade by extracellular TEA, protons, Zn^* and histidme reagents 
(PeBiasi et al., 1993), the models were updated to have this and the hydrophilic 
re»dues in the first part of P form a helix that comprises part of the extracellular, 
outer vestibule of the pore. AVhOe this modf was used also for Na*** and Ca^*** pore 
models (see Fig. 2) where the putative helices are amphipathic, it remains to be 
verified. Modeling of the size and shape of the outer vestibule of K^ channels was 
also aided by the data for the binding of CTX in the extracellular entrance to the 
pore. Similarly, e3q)eriments with peptide toxins such as p. and fl conotoxins may 
prove useful in modeling the outer vestibules of the Na+ and/or Ca^^ channels. 

While important advances have been made, it is important to realize that these 
approaches are still very new. In the future we are likely to see improvements on both 
the theoretical and e3q)erimental sides which will greatly advance the procesis. For 
example, as computational power increases and as methods to calculate protein- 
ligand interactions improve, it will likely become possible to accurately calculate the 
binding affinities of toxins such as TTX and STX to the model pore structures of Na*** 
and Ca^-*- channels and compare these with esqperimental values. This should help 



ModebofNa\ 0^\andK* CkannOs 



13 



discriminate between different pore motifs. Similarly, computational studies of the 
free energy of ion permeation, such as pursued by Roix and Karplus (1994) and Chiu, 
Novotny, and Jakobsson (1993), may also greatiy assist in understanding the struc- 
ture of the pore and the mechanisms that enable ion selectivity. 

Of coixrse, what we need most is hard structural data; the type obtained by x«ray 
crystallography, NMR and electron ciyomicroscopy. Unfortunately, It is unlikely 
that high resolution structures of voltage-gated channels will be obtained anytime 
soon. One reason is that the esqpression systems needed to produce the necessary 
quantity of purified protein have oxily recendy been developed (Santacruz-Toloza, 
Perozo, and Papazian, 1994; Spencer, Taken^, Aiyar, Ngyen, Grissmer, Gutman, 
and Chandy, 1994), and many additional difficulties remain to be overcome. How- 
ever» it may be possible to use synthesized peptide analogues of the P segment with 
or without other segments to make membrane-bound crystals or water soluble. In this 
case, structural models will be essential for deciding how to tether the P segments 
together, and how to make the analogues long enough to span the bilayer or water 
soluble. 
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It is known that over 50 transport systems exist which exhibit putative ATP binding 
domains (Hlggins, 1992). Collectively these are referred to as the ABC transporter 
supergene family. They are referred to also as M-type ATPases (Pedersen and 
Amzel, 1993) because of their structural similarity to the multidrug resistance 
protein (MDR*1 or P-glycoprotein), and in other cases as traffic ATPases (Doige and 
Fern>-Lu22a Ames, 1993), Among this class of transporters are MDR-1, MDR-2, 
CFTR, TAP 1 and 2, STE6, and over 25 bacterial transporters. Most have two 
nucleotide domains. 

A major effort of this laboratory has been to understand structural-functional 
relationships within the nucleotide binding domains of transport ATPases. Specific 
caily, the nucleotide bindnig domains of the CFTR protein (cystic fibrosis transmem*- 
brane conductance regulator) and the mitochondrial ATPase/ATP synthase com- 
plex (an F-type ATPase) have been extensively studied. To this end, we have 
overexpressed these domains in Escheridtia coU (Oarboczi, Hullihen» and Pedersen, 
1988; Lee, Garboczi, Thomas, and Pedersen, 1990; Ko, Thomas, and Pedersen, 
1993), chemically ^thesized key regions containing the nucleotide binding consen- 
sus sequences (Thomas, Shenbagamurthi, Ysem, and Pedersen, 1991; Ko, Thomas, 
and Pedersen, 1994), and used both mutational analysis (Garboczi, Thomas, and 
Pedersen, 1990; Thomas, Garboczi, and Pedersen, 1992a) and biophysical ap- 
proaches (Thomas, Shenbagamurthi, Sondek, HuUhexi, and Pedersen, 1992c; Chuang, 
Gittis, Abeygunawardana, Pedersen, and Mildvan, 1994) including circular dichro- 
ism spectroscopy and NMR, to relate structure to function* In addition, ^qperiments 
are underway in an attempt to crystallize one of these nucleotide domains. 

Work described here will focus specifically on the CFTR protein as mutations 
within both of its nucleotide domains have been linked to cystic fibrosis. Progress 
made m this laboratory in the study of these two nucleotide domains will be briefly 
described. 

CFTR is an integral membrane protein (Riordan, Ronmiens, Kerem, Alon, 
Rozmahel, Grzelczak, ZUenski, Lok, Plausic, and Chow, 1989) comprised within a 
single polypeptide chain of 1,480 amino acids (Fig. 1). The five major domains 
include two nucleotide binding folds (NBFl and NBF2), a regulatory domain (R), 
and two traimnembrane spanning regions (TMSs). The latter, at least in part, form a 
Q- channel that is believed to require, for optimal function, both ATP hydrolysis 
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(Anderson. Berger, Rich. Gregoiy, Smith, and Welsh, 1991) mediated I^NBFl and 
phospboi}dation of the R domam mediated by protein kinase A and/or other ceQular 
kinases (Tabcharani, CSianfe Riordan. Hanrahan, 1991; Cheng, Rich, MarshaD, 
Gregoiy. Welsh, and Smith, 1991). Over 200 different mutations in tiie gen^ 
encoding CFTR induce amino add changes in the protein which in turn canse qrstic 
ffl^is (Tsui, 1992). Most cause mild forms of the disease, whereas others, like 
AF508 C3TR, result m severe fams of the disease. Unfbrtunately, -^90% of all 
cystic fibrosis patients have been reported to have at least one AF508 allele (Kerem, 
Corey, Kerem, R<»nmens, Mariewicz. Levinson, Tsui, and Dune, 1990). 

NBFl spans a ~155 amino add region of CFTR (residues 433-588) and fa- 
dudes a Walker A (GX4GKT/S), a Walker B (RXj^D), and so-called linker or C 
consensus (LSXGXR/K). The Walker A and B consensus are found m many 
nudeotide binding proteins (Walker, Saraste, Runswick, and Gay, 1982) indudfag 
adenylate kinases, ATP synthases, the RecA protein, and all members of the ABC 
transporter superfanxily (Higgins, 1992). The C consensus, however, appears to be a 
unique feature of the ABC transports siqterfomily (Shyamala, Baicfawai, BeaU and 
Ames, 1991). 




Flgnre I. Presentatfcm of the primary structure of the CFTR molecule. The le! 
of the five donuins and the location of the AF508 mutation are depicted. 



As no three-dimensional structural data is available for CFTR, or for any other 
member of the ABC transporter supeifamily, NBFl has been assumed to fbid in a 
manner shnilar to adenylate kinase, the crystal structure of which is known (Schulz, 
eanga, Maix, and Sdiinner, 1974; Sachsenheimer and Schulz, 1977) Fig. 2 A 
depicts the structural fold of adenylated kinase (Schulz et aL, 1974; SadiUnheimer 
and Schulz, 1977). The region shown in black consisting of p strand/o helix/ p strand 
secondary structural organization is known to partidpate in binding ATP (Fry Ruby 
and MUdvan. 1986; MQdvan, 1986). Within this regkm the A consensus is l^tod 
near the mterfece of the o helix and the loop (P loop). TTie A consensus is believed to 
interact directly with one or more of the three phosphate groups of ATP while the a 
hela may partdpate, m part, in binding the purine ribose moiety (Fiy et aL, 1986: 
MUdvan, 1989). TTie B consensus is located in a different region of adenylate kiiiase 
but, nevertheless, Ues near the A consensus, and m some ATP binding piotems is 
beheved to mteract with Mg*+ (Stoiy and Steitz, 1992). 

'J^^J'^J^ ^ ^ consensus of NBFl would be predicted to lie 

witfrni the adenylate kmase fold. Significantly, its position resides downstream from 
the B consensus. Therefore, in three-dimensional space the A. R and C consenaun 
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regions are predicted to lie near one another. The C consensus evidently does not 
contribute to nucleotide binding per se. 

Of the 19 major mutations within NBFl that cause cystic j&brosis (Tsui, 1992), it 
is interesting to note that 13 lie within or near consensus regions 2 B). Ilie 
remaining six mutations lie within or near a hydrophobic 0-strand insert where the 
major disease causing mutation AFS08 occurs. Although this region was originally 
believed tt) be a helical in character (Hyde, Gill, Hubbard, and Higgms, 1990), four 
different programs for predicting secondary structure have been shown more re- 
cently to report ^trand character (Thomas et al., 1991). It is possible that in the 
folding pathway, a p strand is first to form which is then converted to an a-helix in the 
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Figure 2, Adenylate kinaselike fold predicted in all members of the ABC transporter 
superfamily. Note the close pronmity of the A, B» and C consensus to the ATP binding pocket, 
and CFTR to F508. Some invest^tors suggest that the rather hydrophobic insert containing 
F508 may interact with the membrane. (B) Mutations m NBFl that cause ^stic fibrosis. Boxes 
enclose mutations with ccHisensus niotife.i 



final folded structure. Whether or not this segment fits compactly within the 
adenylate kinaselike fold in NBFl or projects out fixnn it and into the membrane as 
speculated by some workers (Hyde et al., 1990; Baichwai, liu, and Ames, 1993; 
Arispe, Rojas, Hartman, Sorscher, and Follardy 1992) remains an interesting concept 
to be tested esqperimentally. 

NBF2 spans a '-les amino acid region of CFTR (residues- 1219-1386) and 
includes the Walker A and Walker B consensus r^ons, as well as the C consensus. 

^ WearBiu8tOBnpletingthehoinolQgymodelmgofNBFlandNBF2u 
x-ray structures. 
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NBF2 exhibits considerable amino add sequence homology to NBFl (Smit, Wilkin- 
son, Mansoura, Collins, and Dawson, 1993), and like NBFl, there are a number of 
mutations within the domain that result in <^tic fibrosis (Tsui, 1992). However, none 
are as severe as that produced by the AFS08 mutation in NBFl. 



Results and Conclusions 

In this section, experiments conducted over the past four years on the two nucleotide 
domains of the CFTR protein are briefly described. Experimental details and a more 
detafled discussion of most of this work can be found in (Ko et al., 1993, 1994; 
Thomas et al,, 1991, 19926^- Thomas and Pedersen, 1993). 

Preparations of NBFl and NBF2 Peptides, and an NBFl Fusion Protein 

When we commenced this work, the CFTR protein had not been isolated, and there 
was no direct evidence that the two predicted nucleotide domains, NBFl and NBF2 
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Figure 3. Peptides corresponding to the indicated regions of NBFl and NBF2 of CFTR. 
These were chemically synthesi2ed on a peptide synthesizer (model 430A, Applied Biosys- 
tems. Inc.) (Thomas et al., 1991; Ko et al., 1994). After hydrogen fluoride deavage &om the 
solid phase* the peptides were purified reverse phase HPLC chromatography on a Waters 
Qg column. The Walker A consensus motif and FS08 are underlined. 



were in fact ATP binding domains. For this reason we designed, chemically synthe- 
sizedy and purified both a 67 amino add segment (F-67) of NBFl (Thomas et al^ 
1991)» which included the Walker A consensus motif and phenylalanine 508, and a 51 
amino acid segment (P-51) of NBF2 (Ko et al, 1994) which also included the Walker 
A consensus motif (Fig. 3). These peptides correspond to the central core region of 
the adenylate kinase model where ATP is known to bind. 

In addition to the above, we employed recombinant DN A technology to 
overexpress in £1 coU the con^lete NBFl domain (F433-^S589) in fusion with the 
maltose binding protein (MBP) (Ko et al., 1993). The MBP protein was used both to 
facilitate purification on an amylose column and to provide a soluble MBP-NBFl 
fusion product that might be suitable for cxystallization. All three proteins, P-67, 
P-51 and MBP-NBFl, were subjected after preparation to a variety of purification 
tests as indicated in Table I. They were all judged to be > 95% pure, and therefore, 
suitable for functional analysis. 
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Secondary stmctnre. The three CFTR segments (P-67 from NBFl, MBP-NBFl, 
and P-Sl from NBF2) were all shown by circular diduoism spectroscopy to exhibit 
significant seoondaxy structure (Table II). Interestingly, the P-67 and P-Sl peptides, 
which both span regions of NBFl and NBF2 containing the Walker A consensus 
motif, distribute their secondaiy structural elements quite differently with PhS7 
exhibiting a high content of ^-strand character relative to a helix and with P-Sl 
exhibiting a high content of a-belical character relative to p strand Neither P-67 nor 
P-Sl showed an obvious propensity to form multimolecular structures, whereas the 
MBP-NBFl construct, and not MBP alone, showed a propensity to form soluble 
organized polymeric structures which under high salt conditions partially dissociate 
and form miccocrystals (see Fig. 8 of Ko et al., 1993). Gearly, a three-dimensional 
structure of the wild-type and AF508-NBF1 would be of particular significance in 
reveaUng the chemical details characterizmg that region of NBFl where the critical 
qrstic fibrosis causing mutation lies. The formation of microcrystals of the MBP- 
NBFl fusion protein represents an encouraging first step. 

TABLE I 

Yidd and Criteria of Ptorily of NBFl and NBF2 Components of CFTR 

CFTR component Region Yield . Criteria of purity 

P«^7 from NBFl R450-*P516 6-10 mg^ HPLC, SDS-PAGE, AA, arid NHj tcmiinus 

sequence analysis 

MBP-MBFl F433->S589 -25mg» HPLQ SDS-PAGE, AA» and NHj teraiinus 

se({uence analysis 

P.51 from NBF2 E1228 T1278 6-10 mg* HPLC, SDS-PAGE, AA, NH2 terminus 

sequence and mass spectral analysis 

"From 300 mg of crude starting materiaL 

♦Per liter cell culture. 

AAt amino acid composition analysis. 

See Ko et aL (1993» 1994) and Thomas et al. (1991) for details. 

ATP binding. All of the above CFTR components prepared in this laboratory 
were found to bind the fluorescent nucleotide analogue trinitrophenyl-ATP (TNP- 
ATP) which could be displaced with ATP. Several control proteins including insulin 
fiajled to bmd TNP-ATP, Table III shows that dissociation constants (KdS) for ATP 
were all in the low mM range (030-1.8 mM) m analogy to numerous intracellular 
ATP-dependent enzymes. All CFTR components also bound TNP-ADP and to a 
much lesser extent TNP-AMP. Mg^* was not necessary for nudeotide bmding nor 
did it appreciably affect values for all of the above CFTR components* 

The studies described above demonstrate that NBFl and NBF2 do mteraci with 
bofliATPandADP. 

Effect of the AF508 mutation on the structure and function of the NBFl peptide 
and on the MBP-NBFl fiision protehi. About 90% of aU cystic fibrosis patients have 
been reported to have at least one AFS08 CFTR allele (Chuang et aL, 1994). 
Therefore, it was important to prepare and characterize the same NBFl segments 
described above, Le., P-67 and MBP-NBFl but lackmg phenylalanme 508. The 
following peptide called p.66 lacking F508 was prepared first employmg a peptide 
synthesizer (model 430 A, Applied Biosystems, Inc., Foster City, CA) using again the 
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same solid phase method used for (Thomas et aL, 1991)- No obvious problems 
were encountered in either the synthesis or subsequent reverse phase HPLC 
purification of P-66. A product judged > 95% pure by SDS-PAGE, HPLC, ammo 
acid, and sequence anal^s was obtained. The yield was 6-10 mg from the crude 
starting material, in the same range as that obtained for the P-67 wild-type peptide. 
MBP-NBFl lacking F508 was then prepared by PGR using the C-1-1/5 clone as the 
template (Ko et aL, 1993). The fusion protein after overexpression in E. coU, was 
purified to apparent homogeneity on an amylose cohimn. The yield of (AF508) 
MBP-NBFl was ^^25 mg/liter of cell culture, in the same range as that obtained for 
the MBP-NBFl wild-Qrpe protehu The P-66 peptide lacking F508 and the (AF508) 
MBP-NBFl construct were then analyzed in detail for structural and functional 
differences from the wild-type proteins. 

P-66, physical characterizatioiu The circular dichroism spectra presented in 
Fig. A A show clearly that the P-66 peptide bearing the AF50S mutation has less 
ordered structure than the wild-type P-67 peptide. Deconvolution of the two spectra 
using four diffisrent programs (Thomas et aL, 19926) indicated diat deletion of F508 



TABLE n 

Bdathrc Seomdavy Stnictoral Elements of NBFl and NW2 Components 
of CFTR as Determined by Circular DIchnton Spectroscopy 



CFIK component 


Region 


Secondary structural element* 
a helix p strand Other* 








^ of total 


P-67 from NBFl 


R450-*R516 


<10 


80 lD-15 


MBP-NBFl 


F433 S589 


40 


24 36 


P-Sl&omNBF2 


E1228-«^T1278 


20-28 


6-11 61^74 



*Deconvolution of the spectral data was carried out using the Prosec program. 
Random coil + turns. 

See Ko et al. (1993, 1994) and Thomas et al. (1991) for details. 



from P-67 to give P-66 is accompanied by a loss of p-sheet secondary structure and a 
corresponding gain of random coil. Significantly, four different programs for predict- 
ing secondary structure indicate that FS08 lies within a ^-strand region (Thomas et 
al., 1991). Therefore, one of the simplest interpretations of these studies is that 
deletion of F508 destabilizes a p-sheet structure and unfolding occurs. 

The circular dichroism e9q)eriments were reproduced in many different «q}eri* 
ments and at many different dilutions of P-66 and P-67, and suggested that deletion 
of F508 produces a less stable peptide. For this reason we exanuned the efiEect of urea 
on the folding of P-67 and P-66, a study which revealed that the P-66 peptide bearing 
the FS08 mutation unfolds (half maximally) at a significantly lower urea concentra- 
tk>n (Fig. 4B). In other studies not presented here, we have shown also that the P-66 
peptide unfolds (half maximally) at a temperature IT below that of P-67. 

P-66, ATP binding. P-66 binds TNP-ATP as well as the wild-type peptide P-67. 
Thus, the localized disruption of structure within the F508 region evidently has little 
or no effect on the ATP binding region containing the Walker A consensus. 
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However, when nxicleotide binding is measured in the presence of 4 M urea the 
wild-type peptide retains significant TNP-ATP binding capacity while the mutant 
peptide has lost this function (TTionias et aL, 1992). It will be noted m Fig. 4 that at 
4 M urea the P-67 peptide retains about one third of its original secondazy structure 
whQe the P-66 peptide is completely unfolded. 

Jn other studies we found no detectable capacity for these peptides to catalyze 
the hydrolysis of ATP. 

ModeL The structural and functional studies on the wild-type P-67 and mutant 
P-66 peptides led us to consider the chemical and s tructur al events that may take 
place when F508 is deleted fiom a ^ sheet region of CFTR. As Olustrated in Fig. 5, 
one possibility is that a register shift will occur requiring reorientation of some amino 
acid side chains from one face of the p sheet to the other- Alternatively, a **p bulge" 
could form maintaining the residues on each face of the p sheet. FinaJly, neither of 
these uew structures may be as stable as the original structure and, thus, wUl not 
spontaneously fold under physiological conditions. We believe that the latter possibU- 
ity is most consistent with our structural and functional studies. 



TABLE m 

Relative Nacleotide-blnding Properties of NBFl and NBF2 Components 
erf CFTR 



CFTR component 


Region 


Dissociadon 

constant 
(Kd) for ATP 


Nudeotide spediicity 






mM 




P-67 from NBF-1 


R450-^R516 


059 


TNP-ATP > TNP-ADP > TNP^AMP 


MBF-NBFl 


F433 S589 


1.80 


TNPATP > TNP-ADP > TNP-AMP 


P-51fromNBF2 


E1228-».T1278 


0.46 


TNP-ATP > TNP-ADP > TNP-AMP 



See Ko et ai. (1993, 1994) and Thomas et al. (1991) for details. 



hypothesis. Early studies on the membrane trafBckmg of intact CFTR within 
the cell were at first puzzling. Some investigators reported that the AFS08 mutant 
protein underwent biosynthetic arrest and faOed to travel to the plasma membrane 
(Rich, Anderson, Oregoiy, <3ieng, Paul, Jefferson, McClIann, Klinger, Smith, and 
Welsh, 1990; Anderson, Rich, Gregoiy. Smith, and Welsh, 1991; Denning, Os- 
tegaard, and Welsh, 1992), whereas other workers reported that the mutant protein 
does travel to the plasma membrane in functional form (Li, Ramjeesingh, Reyers, 
Jensen, Chang, Rommens, and Bear, 1993; Drum, Wilkinson, Smit, Worrell, Strong, 
Frizzell, Dawson, and Collins, 1991). Our studies on purified components, which 
showed that the AF508 mutant peptide (P-66) is less stable, led us to propose that 
protein unfolding may be the molecular basis of most cases of cystic fibrous (Thomas, 
Ko, and Pedersen, 1992b). Thus, we sug^ed that at 3T the AF508 mutant protein 
may be unstable and undergo biosynthetic arrest in some animal cells because of the 
inability to fold mto a compact structure whereas at 2T, ie., in insect cells (Sf9), the 
mutant protein may be able to refold into a stable functional protein and travel to the 
plasma membrane. (In the latter case, it is predicted that deletion of FS08 would be 
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accompanied by the formation of a "remodeled" p-sheet region, two possibilities of 
which are illustrated in Fig. 5). 

Consistent with the above hypothesis Denning, Anderson, Amara, Marshall, 
Smith, and Welsh (1992) demonstrated that reduced temperature facilitates the 
processing of the AFS08 CFTR in 3T3 fibroblasts and C127 cells, resulting m the 
appearance of cAMP-reguIated channels in the plasma membrane. In a subse- 
quent review (Thomas and Pedersen, 1993), we called attention to the possible use of 
chaperones in the treatment of cystic fibrosis. 

(AFS08) MBP-NBFl, physical and fiwctional characterization. In a thorough 
study (Ko ct al., 1993) we found no obvious differences in the physical and functional 
properties of wild-type MBP-NBFl and (AF508) MBP-NBFl. These studies in- 
cluded (a) Secondary structural analysis by circular dichroism spectroscopy; (b) 
Susceptibility to protease digestion; (c) TNP-ATP binding and its competitive 
displacement by ATP; (d) Effect of urea on the nucleotide bmduig funcdon; (e) 
HPLC molecular sieve chromatography; and (/) Propensity to form pofymeric 
structures and microcrystals. 
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Figure 4, (4 ) Comparison of the circular dichroism spectra of the wild type and P-66 peptides 
indicating differences in secondary structure. The mean residue molar elliptidly is plotted vs 
wavelength (reprinted with pmnlsston^JoumalirfBipeneFgeticsandBi^ (B) Com- 

parison of the unfolding patterns of P-67 and P-66 in urea based on changes in the mean 
residue molar ellipticity at 218 nm and reflecting clear differences in the relative stabilities of 
the two peptides, (for details see Thomas et al., 1992^ Thomas and Pedeisen. 1993). 

These studies together with the additional finding that the maltose binding 
protein remains associated with NBFl or AF508-NBF1 even after cleavage of the 
fusion junction, indicates that the obvious structural, stabili^, and fiinctional differ- 
ence between mutant and wild-type domains observed for the P-67 and P-66 
peptides are not evident. This apparent discrepancy may be explained when one 
considers that P-67 and P-66 may represent folding mtermediates on the pathway 
toward the native state whUe the MBP-NBFl fiision proteins might be the finafly 
folded structures in which the presence of one deleted amino add fails to make a 
large difference. Therefore, if one were to obtain a crystal structure of the fusion 
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proteins described here, it may be possible to deduce the minimal structural change 
that occurs when AFS08 is deleted. We intend to work toward this goal. 

Findings briefly summarized above are significant for several reasons. First, 
studies with P-67 and F-66 provided the first experimental evidence indicating that 
the major t^tic fibrosis causing mutation, AF508, alters the structure and stability of 
NBFl. Secondly, these studies provide chemical insist into those changes that may 
take place in NBFl when F508 is deleted. Thirdly, MBP or other proteins may be 




Figure 5. The possible effects of deleting F508 fit)in a p-shcct region, (reprinted with 
pennission from the Journal of Biological Chemistry), The p sheet may reforai into either 
structure 1 or 2. However, if the latter are not themKxfynamically stable structures, th^ would 
not torn as depicted in 3. The experimental data obtained on the P-66 peptide relative to the 
control P-67 peptide indicate that possibility 3 is correct (Thomas et al., 19922>). 



considered to minunize the effect of the deletion of F508 so that the mutant protein 
is more similar to that of the wild-type protein. FinaUy, studies with the MBP-NBFl 
and (AF508) MBP-NHFl fusion proteins have provided the first CFTR systems to 
date amenable to crystallization. 

Other studies In progress. The folding pathway of CFFR is being studied in 
greater detail to define those components of the cell involved in this process. Also, 
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studies are being conducted to establish whether NBFl exhibits the capacity to 
byvirolyze ATP and to interact witli biological membranes. ^ 
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